Lexical bundles in computational linguistics academic literature

نویسنده

  • Adel Rahimi
چکیده

Corpus linguistics has been in the spotlight for the last decade, with the usage of modern computers and technologies deeper understanding of languages can be obtained. Corpus linguistics helps language teaching for acquiring a better view for language. Language teachers will know what sequence of words and patters tend to co-occur. Unlike previous enormous lists of words which students were forced to memorize them, and were seldom used and typically were forgotten in a short period of time lexical bundles can help language teachers to teach more effectively, and learners can be more fluent in the second language. The first studies in lexical bundles include Firth 1964 (Firth, J. R. (1964).) Biber (2004), (Cortes, V. (2004). Native speakers use a formulaic pattern of speech which they are unaware of but learners from other languages use bundles that are affected (transferred) bye their mother tongue and this solely can be problematic and easily recognizable. moreover in academia, when speakers of source language trying to write, publish, and produce academic literature in the target language their lack the fluency and native like features of a native speaker of that language. Lexical bundles (or as called N-grams) are crucial in getting a fluent academic text. In this paper lexical bundles of 1 to 5 tokens from an 8 million word corpus of academic literature from the Computational Linguistics field and its sub topics such as: Speech recognition, Natural Language Processing, Machine Learning, and Information Retrieval have been extracted and analyzed. On the top of that most of typical criteria for exclusion has been applied to the list as well as calculating MI factor for each result to confirm the results and reaching the target bundles for Computational Linguistics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ACADEMIC WRITING REVISITED: A PHRASEOLOGICAL ANALYSIS OF APPLIED LINGUISTICS HIGH-STAKE GENRES FROM THE PERSPECTIVE OF LEXICAL BUNDLES

Lexical bundles are frequent word combinations that commonly appear in different registers. They have been the subject of much research in the area of corpus linguistics during the last decade. While most previous studies of bundles have mainly focused on variations in the use of these word combinations across different registers and a number of disciplines, not much research has been done to e...

متن کامل

Published vs. Postgraduate Writing in Applied Linguistics: The Case of Lexical Bundles

Abstract: Lexical bundles, as building blocks of coherent discourse, have been the subject of much research in the last two decades. While many of such studies have been mainly concerned with  exploring  variations  in  the  use  of  these  word  sequences  across  different  registers  and disciplines, very few have addressed the use of some particular groups of lexical bundles within some gen...

متن کامل

The Use of Lexical Bundles in Native and Non-native Post-graduate Writing: The Case of Applied Linguistics MA Theses

Connor et al. (2008) mention “specifying textual requirements of genres” (p.12) as one of the reasons which have motivated researchers in the analysis of writing. Members of each genre should be able to produce and retrieve these textual requirements appropriately to be considered communicatively proficient. One of the textual requirements of genres is regularities of specific forms and content...

متن کامل

A Comparative Study of Lexical Bundles in Soft Science Articles Written by Native and Iranian Authors

Writing academic texts by novice researchers requires a framework and support by learning how to cite the works of others. However, compared to the studies on other academic writings, studying citations by considering certainty markers has received little attention. The main purpose of this study was to investigate the shifts of certainty markers (hedges and boosters) in pre- and post-citation ...

متن کامل

Lexical Bundles in English Abstracts of Research Articles Written by Iranian Scholars: Examples from Humanities

This paper investigates a special type of recurrent expressions, lexical bundles, defined as a sequence of three or more words that co-occur frequently in a particular register (Biber et al., 1999). Considering the importance of this group of multi-word sequences in academic prose, this study explores the forms and syntactic structures of three- and four-word bundles in English abstracts writte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1603.02905  شماره 

صفحات  -

تاریخ انتشار 2016